Blackwell Optimality for Controlled Diffusion Processes
نویسنده
چکیده
In this paper we study m-discount optimality (m ≥ −1) and Blackwell optimality for a general class of controlled (Markov) diffusion processes. To this end, a key step is to express the expected discounted reward function as a Laurent series, and then search certain control policies that lexicographically maximize themth coefficient of this series form = −1, 0, 1, . . . .This approach naturally leads tom-discount optimality and it gives Blackwell optimality in the limit as m → ∞.
منابع مشابه
Sensitive Discount Optimality via Nested Linear Programs for Ergodic Markov Decision Processes
In this paper we discuss the sensitive discount opti-mality for Markov decision processes. The n-discount optimality is a reened selective criterion, that is a generalization of the average optimality and the bias optimality. Our approach is based on the system of nested linear programs. In the last section we provide an algorithm for the computation of the Blackwell optimal policy. The n-disco...
متن کاملApplying Blackwell optimality: priority mean-payoff games as limits of multi-discounted games
We define and examine priority mean-payoff games — a natural extension of parity games. By adapting the notion of Blackwell optimality borrowed from the theory of Markov decision processes we show that priority mean-payoff games can be seen as a limit of special multi-discounted games.
متن کاملInfinite horizon asymptotic average optimality for large-scale parallel server networks
We study infinite-horizon asymptotic average optimality for parallel server networks with multiple classes of jobs and multiple server pools in the Halfin–Whitt regime. Three control formulations are considered: 1) minimizing the queueing and idleness cost, 2) minimizing the queueing cost under a constraints on idleness at each server pool, and 3) fairly allocating the idle servers among differ...
متن کاملDenumerable controlled Markov chains with strong average optimality criterion: Bounded & unbounded costs
This paper studies discrete-time nonlinear controlled stochastic systems, modeled by controlled Markov chains (CMC) with denumerable state space and compact action space, and with an infinite planning horizon. Recently, there has been a renewed interest in CMC with a long-run, expected average cost (AC) optimality criterion. A classical approach to study average optimality consists in formulati...
متن کاملBlackwell Optimality in Markov Decision Processes with Partial Observation by Dinah Rosenberg,
A Blackwell ε-optimal strategy in a Markov Decision Process is a strategy that is ε-optimal for every discount factor sufficiently close to 1. We prove the existence of Blackwell ε-optimal strategies in finite Markov Decision Processes with partial observation. 1. Introduction. A well-known result by Blackwell [3] states that, in any Markov Decision Process (MDP hereafter) with finitely many st...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009